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Research investigations in the realm of micro-robotics often center around 
strategies addressing the multi-robot task allocation (MRTA) problem. Our 
contribution delves into the collaborative dynamics of micro-robots deployed 
in targeted hostile environments. Employing advanced algorithms, these robots 
play a crucial role in enhancing and streamlining operations within sensitive 
areas. We adopt a tailored GREEDY approach, strategically adjusting weight 
parameters in a multi-objective function that serves as a cost metric. The 
objective function, designed for optimization purposes, aggregates the cost 
functions of all agents involved. Our evaluation meticulously examines the 
MRTA efficiency for each micro-robot, considering dependencies on factors 
such as radio connectivity, available energy, and the absolute and relative 
availability of agents. The central focus is on validating the positive trend 
associated with an increasing number of agents constituting the cluster. Our 
methodology introduces a trio of micro-robots, unveiling a flexible strategy 
aimed at detecting individuals at risk in demanding environments. Each micro- 
robot within the cluster is equipped with logic that ensures compatibility and 
cooperation, enabling them to effectively execute assigned missions. The 


implementation of MRTA-based collaboration algorithms serves as an 
adaptive strategy, optimizing agents' mobility based on specific criteria related 
to the characteristics of the target site. 
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1. INTRODUCTION 

Robotic technology has the potential to replace human workers in high-risk scenarios, reducing the 
potential for human and financial loss. Recent research has led to progress in developing autonomous 
techniques and devices capable of operating without direct human intervention, including working online and 
collaborating with humans in hazardous environments. A major focus of our research is the study of the 
optimal number of robots per task of a cluster (multi-robot teamwork) under specifications and particular 
potentials. The primary objective is to refine control methods for multi-robot in hostile areas. The importance 
of this research is underlined by its significant professional benefits and potential applications. Micro-robots 
are expected to play a key role in improving task distribution and execution, as well as facilitating the 
transmission of relevant information and collected data [1]—[4]. The challenging transmission process makes 
use of affordable means and logic strategies, such as radio communication, within the cluster. 
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To facilitate simulation, we have chosen to utilize the open-source robotics operating system (ROS) 
solution along with its associated tools and modules [5], [6]. ROS will streamline the process of translating 
both conceptual and physical models into reality. An illustrative instance within this field has been developed 
by [7]. 

Multi-robot task allocation (MRTA) problems are intricate case studies. In operational research 
(OR) and for NP-class problems characterized by numerous constraints and a single objective function, 
special approaches are used, such as differential methods and AI-based algorithms. Our decision-making 
process adheres to specific criteria to adopt a proactive and constructive approach: i) form and structure 
(dimensions, degrees of freedom), ii) data transmission (means, persistence), iii) economy and energy 
autonomy (energy security), iv) equipment carried out as payload (sensors), and v) embedded logic. 

The collected data is vital for the cluster. Each agent undergoes an evolution process through a 
collaborative treatment approach. Our goal is to ensure efficient completion of the task in the shortest time, 
while maintaining the importance of the collected data and utilizing dedicated resources. 

The main objective addressed by Rekleitis et al. [8] is the optimization of the generated map by 
minimizing errors at the estimation level, contingent on the robot's capabilities and the attached payload. 
Task coordination for optimal multi-robot evolution through a task-based multi-robot task allocation (MRTA) 
optimal assignment problem (OAP) is emphasized by Gerkey and Matarić [9]. Additionally, Lee et al. [10] 
focus on developing a metric for estimating fault levels within a swarm of robots, while Zhang et al. [11] 
introduce a model based on data correlation, specifically the correlated random walk model, to efficiently 
approximate task searching time for distributions of multi-robot systems in large arenas. 

Recent literature, including [12] and [13], explores bio-inspired techniques for collaboration and 
sharing state information between pursuing agents and fast evaders. Collaboration necessitates meeting 
specific criteria, such as sharing each agent's state information and utilizing onboard resources to complete 
tasks. The context of a hostile site adds complexity, demanding strategies with mathematical intricacy [14], 
[15], influenced by ecosystem characteristics (multi-robot setups, tasks, region of interest, and resources). In 
the case of a hostile site with intricate morphology, the challenge intensifies, relying on capabilities like 
finding a direct path. The objective is to derive optimal solutions [16] for the routes to be followed, 
minimizing time to reach the rescue target [17]. 

In the context of industrial sites, the focus shifts dramatically, with an emphasis on completing 
surveillance in the most relevant manner for incident localization [18]. Other research concentrates on the 
capabilities that micro-robots must possess to acquire cognitive abilities, enabling them to navigate and 
evolve on the study site using reinforced learning methods [19]. Regarding morphology, concrete examples 
like MIT's Cheetah 3 [20] or ANYBOTICS' ANYmal from ETH Zurich [21] meet the requirements of 
degrees of freedom in challenging sites with heightened aggressiveness. 


2. METHOD 

Various tools, including proprietary OnShape assembly and open-source Phobos add-on in blender, 
are utilized for preparatory tasks before simulating under ROS. XACRO, simulation description format 
(SDF), or unified robotics description format (URDF) scripts are essential for developing 3D robot models 
with joints for Gazebo/RVIZ simulation. The JETBOT prototype by NVIDIA serves as the modeled micro- 
robot with updated URDF (SDF) to fit study needs. Elements at this stage are crucial for setting up a 
compatible simulation scene in line with our approach: 

— The actual space is defined by a specific location (the dimensions of an apartment with spatial 
limitations) in 3D [5], [10], [18]. 

— The proposed cluster of multi-robot is a formation of three similar agents [22]. 

— Possibility of the heterogeneous case [23]-[25]. 

E Target, an injured human (3D model of a human, target behavior complexity). 

The standard packaging used in this research are as follows. i) Economy and energy autonomy for a 
given payload (implementation of ROS battery plug-in as shown in Figure 1 within the URDF joints with its 
compatible compiled library). ii) Ensure stability in communication links using the ROS radio connectivity 
plug-in with the cluster. iii) Enhance collaborative capabilities for improved performance by implementing 
necessary plug-ins, rules, and algorithms, as illustrated in Algorithm 1. iv) Algorithm enhancements-based 
Al (AI Logic code integrated). v) Minimum agent's size and volume (for swarm extension purposes). 

The scene requirements in this research are: i) The cluster incorporates three micro-robots (agents) 
denoted as URjlic{1,2], each equipped with motorized wheels for locomotion (differential, driver). ii) Every 
agent unit is furnished with transmission capabilities for data transmission and maintaining communication 
links. iii) The autonomy of every agent is supported by an onboard battery, providing sufficient power for its 
activities within the cluster. iv) Every agent possesses an integrated AI algorithmic foundation, empowering 
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it to make decisions for both collaborative and individual purposes. v) Every agent unit within the cluster is 
outfitted with essential equipment, including a Camera, LIDAR (light detection and ranging), a motor driver, 
actuators, and sensors as shown in Figure 2. vi) The micro-robot group operates in two modes: Supervised 
mode and collaboration mode. 
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Figure 1. Nominal current discharge characteristic (0.65A) 


Algorithm 1. MRTA, Greedy oriented search algorithm 


Begin 
Initializations: 
Let SEA @ 
Let Kọ =- k’ {V k" € [1;R] , R: number of robots} 
Let Tep 
Let Pk = {ey get all previous tasks if they exist 
Ø otherwise 


for (k E [1; R]) do 
Sk= area (k) 
if (k # Ko) then 
Let SKo — (§Ko— gko N sk) 


end if 
end for 
T — task (Ko) |sko 
iel 
while (i< |T|) do 
ie Eval (f° sko) {ith objective function} 
: Ko 74K K 
if (T,°(t,°)C P°) then 
te « None 
end if 
i e itl 
end while 
TK © T? lgo gesto) {for j E€ [l;m], m is set of specific tasks} 


if (T +@) then 
p“ oe pe + TKo 
else 
T- Ø 
for (k E [1; R]) do 
if (k # K,.)do 
T — T + task (k)|skee x, 
iecl 
while (i<|T|) do 
tř- Eval Flee) 
if (TK(tk) c PK) then 


tk — None 
end if 
i« itl 
end while 
Tk o T} lsk pese(ehy a = Ko {for j € [l;m] and m 
end if is set of specific tasks} 
end for 


TX — Best (T*) | kE(1,2,..,R} 
pk = PE + TÉO | xe (1,2,..,R) 
end if 
End. 
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Figure 2. AL Mustaksheef3D, wheeled robot developed 


We aim to effectively locate and aid victims in challenging situations through Collaborative MRTA. 
Our approach involves the implementation of 3D digitization utilizing LIDAR or a depth camera, ensuring 
optimal outcomes for Search and simultaneous localization and mapping (SLAM) purposes. The ROS 
environment serves to manage the context, and, in addition to interoperability, ROS can engage with various 
platforms dedicated to simulating MRTA problems [26]-[30]. The ROS.MSG embedded module facilitates 
message exchange among micro-robots, with inertial measurement unit (IMU) data conversion to ODOM 
(odometry) enabling the validation and voting of agent moves in areas that are still unoccupied as shown in 
Figure 3. 


Service 
invocation 


Publication Topic Subscription 


Figure 3. ROS basics and concepts 


2.1. Distributed MRTA use case and problem statement 

Achieving coordinated teamwork among robots with real-time task distribution necessitates a 
decentralized framework [31] that covers resilient robot awareness, low-level motion control, and high-level 
task scheduling [32], [33]. Effective location management in multi-robot networks is critical for 
collaboration, where decision-making and communication play pivotal roles in mission distribution, 
presenting substantial challenges in data exchange among robots and the operating station [34]. In 
approaches that are self-organized and decentralized, individual robot nodes independently make decisions 
with limited regard for other agents [35]. Decisions in these approaches are frequently influenced by natural 
or real-world phenomena, as highlighted in [3], including bees swarm, market strategy, swarm intelligence, 
and ant colony. These inspirations give rise to complex collective behavior arising from local interactions 
among numerous agents with straightforward behaviors. In such methodologies, sensors play a pivotal role 
by actively collecting local knowledge for sharing within the cluster, as emphasized by Stolzle et al. [7] and 
Ball et al. [36]. The collaborative use of sensors facilitates the accumulation of necessary knowledge 
pertaining to an overarching goal in these procedures as shown in Figure 4. 

Robots need to be able to understand the tasks they have to perform by collecting data through 
sensors and using specific code to make decisions. Broadband, range, power, and data rates are critical to 
system performance. Point-to-point communication is the most basic form, and the choice of transmission 
medium is a function of the type of information being exchanged. Robot evolution on hostile sites can, in 
some manner, be assumed to be a progression of the vehicle forming a path on a surface contoured by a set of 
N points in a space defined in a plane delimited by a closed polygon, where P = P}, Po, ..., Py define a poly- 
point or a set of N points as shown in Figure 5. 
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Figure 4. Cluster architecture 


Ps 


Figure 5. N-gon of a scanned area 


Normally each point is located on the plan (T) by its Cartesian coordinates x; and y;. A one-line 
equation can be written in the form of (1). 


y=ax+b (1) 
a and b are two parameters related to the j® line (slope and y-intercept) in the Polyline defined by P 


components. By using each couple of points coordinates, the related a and b parameters are obtained 
according to (using Cramer's rule). 


ese’ 
Oe ee, (2) 
_ X12 — X21 
b = 2 41 (3) 
X1 — %2 


So, (1) can be like follows for a line equivalent (L1) between two points P, (x1, y1 ) and P, (x2, Y2). y is given 
by (4). 
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X4SX<X2,VxX4-X2<0 
yı — Y2 aga X12 — X2V4 |x2s8x<x1Vx1-x2>0 


yY = 4x1 — xX X4 — X2  [Y1SY<y2Yy1-y2<0 (4) 
Y2SY<Y1YY1-Y2>0 
0 otherwise 


Now, we need to construct a Polygon equation using a combination of multiple-line equations. We have N 
points, which implies that the number of line equations is N (L4, L>, ...., Ly). The n-gon's (polygon) formula 
is given by (5). 


N-1 
Y=Y +Y, + gh y=) thy (5) 
i=1 


Witch is Y, Yo, ...., Yy or simply Y;, Yy where i E N natural strictly positive number and, the set Y, with i € N 
represent each it’ line’s equation: 


XjSX<Xj44,VXj-Xj44<0 
Vi — Vi+1 yu XiYi+1 T Xia Vi |xin1sx<xp~Vxji-xj41>0 
Y; = Y = 4 Xi — Xiz1 Xi — Xia,  [ViSY<Yi+YYi-Yi+1<0 (6) 
Vit1SV<ViVVi-Vi41>9 
0 otherwise 


Equation valid for 1 > i = N — 1, and the last line (Ly): 


X1 SxX<XN,VX4-XN<O 
Yı — Yn gil X1YN — Xn V1 |xnsx<x1,Vx1-xy>0 

Yy =V=4x, — Xy Xi — Xy  [y1sy<ynYy1-yN<0 (7) 
YNSY<Y1i.V¥1-YN>0 


0 otherwise 


For x,y,a@ E€ Rand f(x) = y = a, The area inside the irregular polygon can be defined as the result of: 


Areay = | elayaeara(ew't att (8) 
R 


where x”, x” +1ey(x) 


Consider expressions like n = 2m + 1|men, Where Xn<Xn+1, and y(x) representing the variation domain of 
the variable x. Consequently, the explored area by the kth robot can be denoted as Area(k), where k=1,2, ..., 
R, and R is the total number of cluster agents. We introduce the relation g(a,x), which satisfies the condition 
SOD =A=V= fx" x' xx}, This relation, denoted as g, aids in determining whether the robot k is 
situated within the designated Area(k) or not. To assess whether a robot is outside or inside an area, we can 
analyze four zone-shaped situations illustrating the most probable cases, as depicted in Figures 6(a), 6(b), 
6(c), and 6(d). 

In the first scenario as shown in Figure 6(a), robot; is situated in zone Z, while the other robots 
(robotz and  robot3) are not. Formulating this situation involves expressions such as 
81(0.1,X)=82(02,x)=Z{x',x"} and g3(a,x)= ø . Therefore, by comparing xp, with x’ and x", we can draw the 
following conclusions: 
— If gx(a,x)=0 > the robotk is outside zone Z. 
—  ITfx'Sxp, <x" and gk(a,x)# = the robotk is within zone Z. 
— If Xp, <x' or x"<xp, and gk(a,x)#@ > the robotk is outside zone Z. 

In the second scenario as shown in Figure 6(b), we have g7(a1,X)=2(02,X)=93(03,.X)=Z{x',x.x x} 
robot is within the area where x’ < xg, < x”, while the others are not. Specifically, x” < xp, < x" and 
Xp, < x’, leading to the following determinations: 


- Ifx” is S gre omıivmen aul 8i(a,x)#0 > the robot;is within the area. 
n=2m+1,vm 


n n+1 ; m 
= > a 
If x” < xp, <x METE NS and gi(a,x)#o >the robot, is outside the area 


The third situation as shown in Figure 6(c) encompasses two singular cases, where 
gı(anx)={x',x",x"} and g2(a2,x)={x'}. The robot's presence in the area can only be determined in these cases 
where |g;|=3 for robot; and |g;|=1 for robot: : 

— Ifx’ Ss xp, <x" and gi(a,x)4o where |gi|=3 = the robot; is in the area. 
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—  Ifx’ = xp, and gi(ax)#o where |gi|=/ = the robot; is in the area. 
In the last case as shown in Figure 6(d), it is impossible to ascertain whether the robot is in the zone 
or not using the relation gg, where g:(a,x) #0, and | g¢|=n|n=2m4+5, vm en. 


4 AY e 
JA: robot position JA; robot position 


2 S| 
as U3 H 
o ES a || Ce ley Se — a 
Oy R H | 
| Pepe 
t | j | i 
ae = | | H 
a i | | 
j ij i p+ a 
A ppb | Per TT Ff | P? 
as | | d E lst Lel Le faw» x 
0 | tyr H i i 0 L ix 4 es ; x {Xx > 
Xoz Xar Xps Xg i Xr! IXa: | | 
(a) (b) 
Y ; J: robot position 


Py } Í Pa 


® 


Figure 6. Potential scenarios for the robot's posture (a) simple case with 3 robots, (b) complex case with 
3 robots, (c) singular case, and (d) insoluble case 


2.2. Simulation cases and success factors 

Before initiating our simulations, we define our context and our environment. To simulate 
collaboration and explore to locate a victim, a workstation is employed. This workstation features an HP 
ProBook x360 435 G7 with an AMD Ryzen 7 PRO 4750U processor, AMD Radeon RX Vega 7 graphics, 32 
GB DDR4 Kingston RAM, and a 1TB Samsung SSD Drive. The simulation utilizes ROS [37] on Linux 
Ubuntu 20.04, enhancing node communication and facilitating message passing between robots and 
computers. Additionally, JetBot, powered by NVIDIA Jetson Nano and accessible at NVIDIA, supports 
sensors and implements RNN for object detection and collision avoidance. Furthermore, JetBot is capable of 
connecting through various radio links. 


2.2.1. Optimization based algorithm 
The next flowchart illustrates the overall process as shown in Figure 7. The cluster collects data 
related to the ROI's context, investigates frontiers around each micro-robot, generates map fragments, and 
shares them within the cluster. If tasks are completed, processes can be disposed of, otherwise, the 
preparation step is repeated. In a practical scenario, each micro-robot needs to meet the optimal condition 
defined by the cost function feos? in (9). 
— Radio connectivity to the sink or to all cluster Fyonap which guarantees data exchange and/or maintain a 
link (AP for access point). 
— Economic and energy autonomy Futon quantifies the battery lifespan for a particular task. 
— Absolute availability Disp", Denotes the 'OK state' of the unit within the cluster, signifying its 
capability to proficiently carry out the task. 
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— The relative availability, denoted as Disp“ 


specific resource crucial for a particular task. 


, signifies the accessibility of the payload, which is a 


oe =k, * Feon/AP + kz * Fauton + k3 * Disp” +k, * Disp /Res 


cost 
where kiı+k2+kz+k4=1 


(9) 


unknown 
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Figure 7. Flowchart of our approach 


The weights k; for i€{1,4] symbolize the contributions of each specific part to the agent's functional 
cost, as defined earlier. Our goal is to discover the optimal outcome (10) for the function fcost for every agent 
in the cluster, representing the best solution. Subsequently, 


t UR; 7 
fe Max| jejari To constraints} (10) 


The cluster's optimal estimate is indicated by its capability to declare an 'OK' status for availability, 
surpassing the minimum required condition for executing assigned tasks. We introduce Fonap as the reduced 
effective availability of at least one link with a predefined access point. If Ft@frepresents the reference 
threshold for a Wi-Fi connection to an access point and Feon(t) signifies the current connectivity level of an 
agent to the access point, then Fconar is defined as the ratio of the instant connectivity Fron(t) to the reference 


threshold F!$f, This is expressed as (11): 

Feon/ap = 100 * (Fcon(t)/Fécn (11) 
Therefore, Fauton signifies the battery autonomy, indicating the remaining energy in Ah needed by the battery 
to provide adequate power for the agent to successfully accomplish the assigned task as a singular unit within 
the cluster. Here, we represent t as the estimated time required for the assigned task, J as the actual 


discharged current of the battery in Ampere (A), and C as the battery capacity, acting as a current source for a 
specified duration in Ampere-hour (Ah). Hence, the expressions are as (12). 


Fauton = C/T * A/T — 10/Penarge) (12) 
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In estimating Disp" and Disp“, the availability of a cluster member is assessed based on its effective 
suitability for any given task. Relative availability for a resource indicates the agent's capacity to have the 
specified resource (payload) ready at the designated time when a task is assigned. On the other hand, absolute 
availability encompasses the sum of all relative availabilities (of N,es resources), signifying that all of the 
agent's resources are ready for use. The relative availability is set to 1 when there is positive feedback upon 
querying the resource and 0 otherwise. Therefore: 


Lif the targeted resource is available 


Pret/res = { 0 otherwise ue 
Absolute availability is given by (14). 
Nres 
Disp" = ( )(Dperjres') (14) 
i=1 


The agent's ability to fully intervene in a task depends on the logic AND connection of all relevant 
availability functions. The weighting coefficients k; are selected using various methods to optimize the 
objective function's final result. We adopt a bio-inspired approach to determine these coefficients. 


2.3. Collaboration by greedy algorithm for distributed MRTA 

We introduce a greedy algorithm in Algorithm 1 [3] designed for the MRTA problem with 
broadcast messaging. This algorithm is configured to allocate agents based on the optimal assignment to a 
task that maximizes the performance-to-cost ratio (V//di*). Here, V# represents the performance of the k” 
agent for the accomplishment of the i” task, and d* denotes the Euclidean distance between the agent and the 
target/assigned location (8) and (9). 


di = y &i = x)? + (Yi = Yid? + (zi = 2)? (15) 


TK = max(V¥/dř) = max(Vřnř) with nf = 1/dř (16) 


Here, task i is the chosen task for the k” agent from the entire set of potential tasks, and T represents the set 
of viable tasks within the k” agent's scope out of the total M available tasks. The algorithm, identified as 
Algorithm 1, operates as follows: Initially, we define surface variables explored by each robot;, where Sx|xer 
denotes the set of boundary points of the presently scanned surface. Additionally, f|, represents an objective 
function (cost function) for each robotų, with task; being the sub-mission stored in 7 Furthermore, T; is the 
collection of tasks that have not been assigned yet and are included in T, representing the total available tasks 
(T; ET). Concurrently, T* for k € R is the selected task for the robot, and P denotes the locations that have 
already been assembled. Here, k indicates the robot that needs to update its bid at the current stage. 

At the start, no tasks are assigned, so T*=ø for all robots kE R. In each step, one task is allocated to 
a single robot independently as shown in Figure 8, following decentralization. Thus, we need |T| steps, 
the number of tasks for the robot, to complete its state. At each iteration i, after removing conflicting parts 
with areas explored by other robots, all robots k € R submit an offer (t, T*). Each robot k selects task 7* 
from the list of non-located tasks TŹ to maximize its objective function f*. Upon gathering all the bids, we 
discover a superior optimal gain for the collective objective: the multiplicative success of group F (17). 
Through bidding, we efficiently choose the optimal task pair-robot combination for the greatest overall 
benefit [38], [39]. 


F = Besti | [be a7 
KER 


When a robot is surrounded by other robots, it can be misled about its capabilities, hindering its 
development and exploration. This situation can impede the robot's ability to effectively navigate and 
fulfill its tasks. To address this issue and maintain high performance even when surrounded (7*=¢), our 
objective is to enhance the robot's cost function by utilizing the neighboring robots’ available spaces until 
it can operate independently. By incorporating information from surrounding robots, the robot can make 
more informed decisions and adapt its behavior, accordingly, ultimately improving its overall performance 
and autonomy. 
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Figure 8. JetBot during domain exploration 


3. RESULTS AND DISCUSSION 

The simulation scenario is illustrated in Figure 9. It is characterized by various constraints that 
evaluate the efficiency and reliability of our approach in navigating a cluster of micro-robots, both with and 
without collaboration. The simulation scene consists of rooms (isolated spaces) containing furniture 
(obstacles). Micro-robots work together to rescue multiple victims as shown in Figure 10, to minimize time 
and enhance reliability. The simulation illustrates three severity levels involving 1-3 micro-robots as shown 
in Figure 11. 


furniture 


Figure 10. Scene with target location 


Figure 11. Space segmentation 
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We make the following assumptions for the simulation: 
— The scene's domain and dimensions remain constant throughout the investigation. 
— The target position within the scene does not significantly affect the simulation's duration. 
— The agents (micro-robots) are assumed to be homogeneous, possessing identical characteristics. 
— Connectivity, Payload, and Autonomy conditions are satisfactory for all units within the cluster. 

The simulation's outcomes are presented in the subsequent Figures 11 to 15 and Tables 1 to 8. The 
plan involves finding the time to reach each victim, pinpointing the longest time to reach two victims, and 
averaging the maximum values. We focus on the maximum and minimum average durations. The regions in 
Figure 15 show designated areas in RGB colors within the All Area. These areas come from segmenting the 
total ROI using the greedy algorithm. The process is repeated three times. The first iteration involves a single 
robot over ten attempts, and the results are in Table 1. Where time distribution by number of attempts is 
given (with trend function) in the next chart as shown in Figure 16. 


Figure 12. The used robot model 


Figure 14. Robots while performing a mission Figure 15. Agents ROI updates 
Table 1. Simulation time to find the two victims Ges) z -_ 
(in seconds) 140 }—— — — 
Attempts Targetl Target2 To mission's end 120. mE See ` 
1* 55 102 102 100 |— ee @istsss-e- Sn S 
o 
gn 70 30 70 so é a 
3 20 150 150 60 >í PERI 
4 93 35 93 40 IC Se ERE 
| R=0.01S1 
5th 64 86 86 ji 
6" 46 90 90 o 
7° 77 112 112 o 2 4 6 8 10 (tests) | 
gn 95 150 150 
gh 9 76 76 f ; ; 
10" 110 123 123 Figure 16. Duration taken by a single robot to reach 
Mean 63.9 95.4 105.2 the target on the initial attempt 
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The process involves ten simulations, each done three times. Results are assessed across five 
different scenarios (cases I to V). In each test, we note the time to meet all goals until finding two targets 
(victims), with and without considering cooperation. 

— Scenario I: single robot (refer to Table 2). 

— Scenario II: two robots without collaboration (refer to Table 3). 

— Scenario III: two robots with collaboration (refer to Table 4). 

— Scenario IV: three robots without collaboration (refer to Table 5). 
— Scenario V: three robots with collaboration (refer to Table 6). 


Table 2. Simulation time of one robot to find the two Table 3. Two robots without collaboration algorithm 


victims (in seconds) average time (sec) 
targetl target2 duration targetl target2 duration 
Exp. 1 80.2 37.5 90.7 Exp.1 72 38.2 85.3 
Exp.2 67.5 99.5 120.6 Exp.2 72 61.6 95.2 
Exp.3 63.9 95.4 105.2 Exp.3 62.4 40.3 66.9 
Table 4. Two robots with collaboration algorithm Table 5. Three robots without collaboration 
average time (sec) algorithm average time (sec) 
target] target2 duration target] target2 duration 
Exp.1 27.2 21.7 34.2 Exp.1 72.6 32.2 76.5 
Exp.2 33.8 30.4 47.5 Exp.2 57.8 39.3 69.1 
Exp.3 60.1 38.9 41.7 Exp.3 71.2 29 73.9 


Table 6. Three robots with collaboration algorithm average time (sec) 


target! target2 duration 
Exp.1 22.7 7.9 22.7 
Exp.2 18.3 21.5 24 
Exp.3 18.4 14.1 20.7 


Aggregate outcomes are compiled as the min and max values for potential consolidation as shown in 
Tables 7 and 8. Table 7 presents the average time durations across various experimental conditions, offering 
insights into the comparative efficiency of different strategies. In contrast, Table 8 delineates the minimum 
and maximum simulation times required to locate two victims, providing a comprehensive view of 
performance variability under different scenarios. 


Table 7. Time duration average by experiment 
Exp.1 Exp.2 Exp.3 
One Robot 90.7 120.6 105.2 
2NoCollab 85.3 95.2 66.9 
2WCollab 34.2 47.5 41.7 
3NoCollab 76.5 69.1 73.9 
3WCollab 22.7 24 20.7 


Table 8. Simulation time to find the two victims (in seconds) 
One 2NoCol 2WCol 3NoCol 3WCol 
Min 90.7 66 34.2 69.1 20.7 
Max 120.6 95.2 62.1 76.5 24 


Collaboration increasingly impacts the duration of time as shown in Figures 17 and 18. These 
figures vividly illustrate the significant influence of collaboration on time duration, underscoring its growing 
importance in various contexts. Additionally, as the number of agents increases, the duration decreases, 
particularly when a cooperative strategy is enabled, highlighting the efficiency gains achieved through 
collaborative efforts, especially in scenarios involving larger groups. 
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Figure 17. Progression of time duration through Figure 18. Max and min time duration by experiment 
experimentation until the victim is found 


The ROI was evaluated in two ways: by pinpointing the targets and by 3D scanning the area for 
potential reconnaissance. This was done using an enhanced version of a horizontal LIDAR (RPLiDAR 
A1M8) as part of the new features on our AL Moustaksheef3D platform, a robotic payload for 3D SLAM 
currently in development. The enhanced LiDAR as shown in Figures 19 and 20 was tested on a wheeled 
machine and a Drone. The collected data as shown in Figure 21 is provided in point cloud (PCL) of “.pcd” 
format. This model integrates two LiDARs, with the first used for geolocation and the second for 3D vision. 
Scene reproduction yields relative data for estimating facts in the study area. Point-cloud data is used for 
scene reconstruction. Software like CloudCompare, MeshLab, Blender, and Gimp enable background 
processing on point cloud data. The micro-cobot agents integrate APIs into their embedded logic, employing 
robust algorithms for identifying specific targets in hostile environments. Using an MRTA approach with the 
enhanced Greedy algorithm, our strategy optimizes target identification through cluster agent cooperation, 
significantly reducing task time. This collaborative approach requires substantial processor capabilities, with 
basic calculations centralized for collaboration and decentralized for independent agent decision-making 
post-task assignment. 


Figure 19. Figure 20. The 3D model Figure 21. ROI SLAM Segmentation, on 
JetBot/AlMustaksheef3D of the new LiDAR RVIZ and Gazebo 


4. CONCLUSION 

The project aims to design collaborative robots with physical capability and intelligence to work 
effectively in challenging environments. These cobots will help reduce the workload of human response or 
rescue teams by identifying targets. They will also help in areas where the density of robots and technicians 
poses a risk to human safety. 

Our research focuses on micro-robots collaborating with humans, leading to the concept of 
micro-cobots. This aims to reduce payload and minimize bottlenecks in tasks. Increasing micro-robot 
agents enhances investigation speed and target search. The results prove our initial consideration and 
confirm the predefined hypotheses. The results obtained from the simulation explain that the cooperation 
significantly improves the cluster's progress in the search mission. Collaboration prevents erratic 
micro-robot behavior, reduces duration, and prevents system collapse. Fewer agents lead to more 
dispersion and chaos. A moderate number of investigators (7 <Nr<3) is preferable. Swarm techniques with 
collaboration are used to manage the situation. 


Distributed and autonomous multi-robot for task allocation and collaboration ... (Tamali Abderrahmane) 


218 m) ISSN: 2722-2586 


REFERENCES 

[1] F. Montori, L. Bedogni, and L. Bononi, “A Collaborative Internet of Things Architecture for Smart Cities and Environmental 
Monitoring,” IEEE Internet of Things Journal, vol. 5, no. 2, pp. 592—605, Apr. 2018, doi: 10.1109/JIOT.2017.2720855. 

[2] Y. Dong, L. Zhang, and Y. Liu, “Experimental study on control strategy of cobot micro operation force hoisting system,” 
Gaojishu Tongxin/Chinese High Technology Letters, vol. 17, no. 5, pp. 493—497, 2007. 

[3] A. El shenawy, K. Mohamed, and H. M. Harb, “Exploration Strategies of Coordinated Multi-Robot System: A Comparative 
Study,” JAES International Journal of Robotics and Automation (IJRA), vol. 7, no. 1, p. 48, Mar. 2018, doi: 
10.11591/ijra.v7i1.pp48-58. 

[4] L El Mallahi, J. Riffi, H. Tairi, A. Ez-Zahout, and M. A. Mahraz, “A Distributed Big Data Analytics Model for Traffic Accidents 
Classification and Recognition based on SparkMILib Cores,” Journal of Automation, Mobile Robotics and Intelligent Systems, 
vol. 16, no. 4, pp. 62-71, Oct. 2022, doi: 10.143 13/jamris/4-2022/34. 

[5] X. Wang, X. Ma, X. Li, X. Ma, and C. Li, “Target-biased informed trees: sampling-based method for optimal motion planning in 
complex environments,” Journal of Computational Design and Engineering, vol. 9, no. 2, pp. 755-771, Apr. 2022, doi: 
10.1093/jcde/qwac025. 

[6] S. Michieletto, S. Ghidoni, E. Pagello, M. Moro, and E. Menegatti, “Why teach robotics using ROS,” Journal of Automation, 
Mobile Robotics and Intelligent Systems, pp. 60—68, Jan. 2014, doi: 10.14313/jamris_1-2014/8. 

[7] M. Stolzle, T. Miki, L. Gerdes, M. Azkarate, and M. Hutter, “Reconstructing Occluded Elevation Information in Terrain Maps 
with Self-Supervised Learning,” IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 1697-1704, Apr. 2022, doi: 
10.1109/LRA.2022.3141662. 

[8] I. M. Rekleitis, G. Dudek, and E. E. Milios, “Multi-robot collaboration for robust exploration,” in Proceedings-IEEE 
International Conference on Robotics and Automation, 2000, vol. 4, pp. 3164-3169, doi: 10.1109/ROBOT.2000.845 150. 

[9] B. P. Gerkey and M. J. Matarić, “Multi-robot task allocation: Analyzing the complexity and optimality of key architectures,” in 
Proceedings - IEEE International Conference on Robotics and Automation, 2003, vol. 3, pp. 3862-3868, doi: 
10.1109/robot.2003.1242189. 

[10] S. Lee, E. Milner, and S. Hauert, “A Data-Driven Method for Metric Extraction to Detect Faults in Robot Swarms,” IEEE 
Robotics and Automation Letters, vol. 7, no. 4, pp. 10746-10753, Oct. 2022, doi: 10.1109/LRA.2022.3189789. 

[11] Y. Zhang, D. Boley, J. Harwell, and M. Gini, “A Correlated Random Walk Model to Rapidly Approximate Hitting Time 
Distributions in Multi-robot Systems,” in Lecture Notes in Networks and Systems, vol. 577 LNNS, Springer Nature Switzerland, 
2023, pp. 724-736. 

[12] X. Fu, Y. Zhang, J. Zhu, and Q. Wang, “Bioinspired cooperative control method of a pursuer group vs. a faster evader in a limited 
area,” Applied Intelligence, vol. 53, no. 6, pp. 6736-6752, Jul. 2023, doi: 10.1007/s10489-022-03892-8. 

[13] G. Merkulov, M. Weiss, and T. Shima, “Virtual Target Approach for Multi-Evader Intercept,” in 2022 European Control 
Conference, ECC 2022, Jul. 2022, pp. 1491-1496, doi: 10.23919/ECC55457.2022.9838427. 

[14] A. Giusti, G. C. Maffettone, D. Fiore, M. Coraggio, and M. di Bernardo, “Distributed control for geometric pattern formation of 
large-scale multirobot systems,” Frontiers in Robotics and AI, vol. 10, Sep. 2023, doi: 10.3389/frobt.2023.1219931. 

[15] S. Wang, Z. Li, H. Gao, K. Shan, J. Li, and H. Yu, “Agile Running Control for Bipedal Robot Based on 3D-SLIP Model 
Regulation in Task-Space,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and 
Lecture Notes in Bioinformatics), vol. 13455 LNAI, Springer International Publishing, 2022, pp. 505-516. 

[16] P. Fong, C. Fang, and J. He, “Optimal Attack Against Coverage Path Planning in Multi-robot System,” in Lecture Notes in 
Electrical Engineering, vol. 934 LNEE, Springer Nature Singapore, 2023, pp. 1760-1772. 

[17] L. E. Parker, “Multiple Mobile Robot Teams, Path Planning and Motion Coordination in,” in Encyclopedia of Complexity and 
Systems Science, Springer New York, 2009, pp. 5783-5800. 

[18] S.H. Kim, S. G. Yoon, S. H. Chae, and S. Park, “Economic and environmental optimization of a multi-site utility network for an 
industrial complex,” Journal of Environmental Management, vol. 91, no. 3, pp. 690-705, Jan. 2010, doi: 
10.1016/j.jenvman.2009.09.033. 

[19] A. Yahya, A. Li, M. Kalakrishnan, Y. Chebotar, and S. Levine, “Collective robot reinforcement learning with distributed 
asynchronous guided policy search,” in IEEE International Conference on Intelligent Robots and Systems, Sep. 2017, vol. 2017- 
September, pp. 79-86, doi: 10.1109/IROS.2017.8202141. 

[20] J. Di Carlo, P. M. Wensing, B. Katz, G. Bledt, and S. Kim, “Dynamic Locomotion in the MIT Cheetah 3 Through Convex Model- 
Predictive Control,” in IEEE International Conference on Intelligent Robots and Systems, Oct. 2018, pp. 7440-7447, doi: 
10.1109/IROS.2018.8594448. 

[21] M. Bjelonic et al., “Offline motion libraries and online MPC for advanced mobility skills,” International Journal of Robotics 
Research, vol. 41, no. 9-10, pp. 903—924, Jun. 2022, doi: 10.1177/02783649221 102473. 

[22] K. H. Lee, O. Nachum, T. Zhang, S. Guadarrama, J. Tan, and W. Yu, “PI-ARS: Accelerating Evolution-Learned Visual- 
Locomotion with Predictive Information Representations,” in IEEE International Conference on Intelligent Robots and Systems, 
Oct. 2022, vol. 2022-October, pp. 1447—1454, doi: 10.1109/IROS47612.2022.998 1952. 

[23] A. Das, O. Naroditsky, Z. Zhu, S. Samarasekera, and R. Kumar, “Robust visual path following for heterogeneous mobile 
platforms,” in Proceedings - IEEE International Conference on Robotics and Automation, May 2010, pp. 2431-2437, doi: 
10.1109/ROBOT.2010.5509699. 

[24] M. Debord, W. Honig, and N. Ayanian, “Trajectory Planning for Heterogeneous Robot Teams,” in IEEE International 
Conference on Intelligent Robots and Systems, Oct. 2018, pp. 7924-7931, doi: 10.1109/IROS.2018.8593876. 

[25] Z. G. Saribatur, V. Patoglu, and E. Erdem, “Finding optimal feasible global plans for multiple teams of heterogeneous robots 
using hybrid reasoning: an application to cognitive factories,” Autonomous Robots, vol. 43, no. 1, pp. 213-238, Mar. 2019, doi: 
10.1007/s10514-018-9721-x. 

[26] J. Collins, S. Chand, A. Vanderkop, and D. Howard, “A review of physics simulators for robotic applications,” IEEE Access, vol. 
9, pp. 51416-51431, 2021, doi: 10.1109/ACCESS.2021.3068769. 

[27] Y. Yu, Z. Miao, X. Wang, and L. Shen, “Cooperative circumnavigation control of multiple unicycle-type robots with non- 
identical input constraints,’ IET Control Theory and Applications, vol. 16, no. 9, pp. 889-901, Mar. 2022, doi: 
10.1049/cth2.12275. 

[28] C. Calderén-Arce, J. C. Brenes-Torres, and R. Solis-Ortega, “Swarm Robotics: Simulators, Platforms and Applications Review,” 
Computation, vol. 10, no. 6, p. 80, May 2022, doi: 10.3390/computation 10060080. 

[29] M. Garzón et al., “Using ROS in multi-robot systems: Experiences and lessons learned from real-world field tests,” in Studies in 
Computational Intelligence, vol. 707, Springer International Publishing, 2017, pp. 449-483. 


IAES Int J Rob & Autom, Vol. 13, No. 2, June 2024: 205-219 


IAES Int J Rob & Autom ISSN: 2722-2586 Oo 219 


[30] 
[31] 


[32] 


[33] 


[34] 
[35] 
[36] 
[37] 
[38] 


[39] 


C. Pinciroli et al., “ARGoS: A modular, parallel, multi-engine simulator for multi-robot systems,” Swarm Intelligence, vol. 6, no. 
4, pp. 271-295, Nov. 2012, doi: 10.1007/s11721-012-0072-5. 

M. U. Arif and S. Haider, “A Flexible Framework for Diverse Multi-Robot Task Allocation Scenarios Including Multi-Tasking,” 
ACM Transactions on Autonomous and Adaptive Systems, vol. 16, no. 1, pp. 1-23, Mar. 2022, doi: 10.1145/3502200. 

P. Fraisse, D. Andreu, R. Zapata, J. P. Richard, and T. Divoux, “Remote decentralized control strategy for cooperative mobile 
robots,” in 2004 8th International Conference on Control, Automation, Robotics and Vision (ICARCV), 2004, vol. 2, pp. 1011- 
1016, doi: 10.1109/icarcv.2004. 1468982. 

M. Dihya, M. Moufid, B. Chemseddine, and B. Moussaab, “Switched time delay control based on neural network for fault 
detection and compensation in robot,” JAES International Journal of Robotics and Automation (IJRA), vol. 10, no. 2, p. 91, Jun. 
2021, doi: 10.11591/jra.v10i2.pp91-103. 

W. Labunski and A. Burghardt, “Software for the Control and Monitoring of Work of a Collaborative Robot,” Journal of 
Automation, Mobile Robotics and Intelligent Systems, vol. 2021, no. 3, pp. 29-36, May 2021, doi: 10.14313/JAMRIS/3-2021/16. 
Y.-C. Wu, J.-W. Lee, and H.-C. Wang, “Robots for search site monitoring, suspect guarding, and evidence identification,” JAES 
International Journal of Robotics and Automation (IJRA), vol. 9, no. 2, p. 84, Jun. 2020, doi: 10.1159 1/ijra.v912.pp84-93. 

M. G. Ball, B. Qela, and S. Wesolkowski, “A review of the use of computational intelligence in the design of military surveillance 
networks,” in Studies in Computational Intelligence, vol. 621, Springer International Publishing, 2016, pp. 663-693. 

L. Joseph and J. Cacace, “Mastering ROS for robotics programming: design, build, and simulate complex robots using Robot 
Operating System,” 2015. 

M. Otte, M. J. Kuhlman, and D. Sofge, “Auctions for multi-robot task allocation in communication limited environments,” 
Autonomous Robots, vol. 44, no. 3—4, pp. 547-584, Jan. 2020, doi: 10.1007/s10514-019-09828-5. 

D. Tihanyi, Y. Lu, O. Karaca, and M. Kamgarpour, “Multi-robot task allocation for safe planning against stochastic hazard 
dynamics,” Jun. 2023, doi: 10.23919/ECC57647.2023.10178126. 


BIOGRAPHIES OF AUTHORS 


Abderrahmane Tamali © ÉJ I> received a master’s degree in Telecommunications 
Systems from Tahri Mohammed Bechar University. Currently pursuing a Ph.D. in Networks 
and Telecommunications at Ferhat Abbas Setif1 University, he excels in both substantive and 
group work environments. He possesses a diverse skill set, including programming, electronic 
application development, manipulation of Linux environments, development of robotic 
machines, utilization of ROS framework, and IoT applications. In addition to his academic 
achievements, Abderrahmene holds a senior IT technician diploma with a specialization in 
computer network maintenance. He is proficient in computer networks and is adept at 
troubleshooting and maintaining network systems. He can be contacted via email at 
abdou.t10@ gmail.com or abderrahmane.tamali @univ-setif.dz. 


Nourredine Amardjia © E I> received an engineering degree in electronics in 1982 
from ENPA (Ecole Nationale Polytechnique d'Alger), Algeria, his Master of Science degree in 
electrical engineering in 1985, from Fairleigh Dickinson University, New Jersey, USA, and 
his State Doctorate (Ph.D.) in communications in 2007 from University of Setif, Algeria. He 
joined the Electronics Department, University of Setif as an assistant professor in 1986. He 
was granted the level of associate professor in 2007. He is a member of the LIS laboratory, the 
University of Setif. His research interests include discrete transforms, image processing, filter 
design techniques, systolic architectures, and fast algorithms for signal processing 
applications. He can be contacted at amardjianour @univ-setif.dz. 


Mohammed Tamali © &:4 EJ © graduated from USTO-MB as a state engineer in Electrical 
engineering, he received his M.Sc. in 1996 in Energetical physics from Bechar University and 
the Ph.D. degree from UST-MB of Oran, Algeria in 2007. In 2013, he became a professor in 
Electrical Engineering. He is head of the SimulIA research team at the ENERGARID Lab. His 
fields of interest included power electrical systems, scientific computing tools, sustainable 
development, environmental studies applied to distributed electrical network optimization, and 
system theory application on power systems. He worked as a research professor at the University 
of Bechar from 1986 until today. He can be contacted at tamali: mohammed @univ-bechar.dz. 


Distributed and autonomous multi-robot for task allocation and collaboration ... (Tamali Abderrahmane) 


